首页> 外文OA文献 >Computational detection and location of transcription start sites in mammalian genomic DNA
【2h】

Computational detection and location of transcription start sites in mammalian genomic DNA

机译:哺乳动物基因组DNA中转录起始位点的计算检测和定位

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Transcription, the process whereby RNA copies are made from sections of the DNA genome, is directed by promoter regions. These define the transcription start site, and also the set of cellular conditions under which the promoter is active. At least in more complex species, it appears to be common for genes to have several different transcription start sites, which may be active under different conditions. Eukaryotic promoters are complex and fairly diffuse structures, which have proven hard to detect in silico. We show that a novel hybrid machine-learning method is able to build useful models of promoters for >50% of human transcription start sites. We estimate specificity to be >70%, and demonstrate good positional accuracy. Based on the structure of our learned models, we conclude that a signal resembling the well known TATA box, together with flanking regions of C-G enrichment, are the most important sequence-based signals marking sites of transcriptional initiation at a large class of typical promoters.
机译:转录是由启动子区域指导的过程,转录过程是从DNA基因组的各个部分制备RNA副本。这些定义了转录起始位点,以及启动子在其下有活性的一系列细胞条件。至少在更复杂的物种中,基因具有几个不同的转录起始位点似乎很常见,这些位点可能在不同条件下具有活性。真核启动子是复杂且相当分散的结构,已证明很难在计算机上检测到。我们表明,一种新型的混合机器学习方法能够为> 50%的人类转录起始位点建立有用的启动子模型。我们估计特异性为> 70%,并显示出良好的位置准确性。根据我们所学模型的结构,我们得出结论,类似于众所周知的TATA盒的信号,以及C-G富集的侧翼区域,是在大多数典型启动子上标记转录起始位点的最重要的基于序列的信号。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号